Data quality and data cleaning in database applications
نویسنده
چکیده
........................................................................................................................................ II Acknowledgement........................................................................................................................ IV Publications from the PhD work ................................................................................................... V Table of
منابع مشابه
Cleaning uncertain data with quality guarantees
Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to “clean” the database (e.g., ...
متن کاملAcademic Statement by Leopoldo Bertossi
(A) Data Management and Business Intelligence. Specific areas of interest and research have been: (a) Inconsistency management in databases. (b) Virtual data integration. (c) Multidimensional databases, in particular semantics problems and their impact on OLAP and data analytics. (d) Peer data exchange. (e) Contexts for data management. (f) Data quality assessment and data cleaning, in particul...
متن کاملOntology-Based Multidimensional Contexts with Applications to Quality Data Specification and Extraction
Data quality assessment and data cleaning are context dependent activities. Starting from this observation, in previous work a context model for the assessment of the quality of a database was proposed. A context takes the form of a possibly virtual database or a data integration system into which the database under assessment is mapped, for additional analysis, processing, and quality data ext...
متن کاملQuality Assurance of Government Databases
Data cleaning is a vital process that ensures the quality of data stored in real-world databases. The process of identifying the record pairs that represent the same entity (duplicate records), commonly known as record linkage, is one of the essential elements of data cleaning. Digital government serves as an emerging area for database research, such as database management, data integration, da...
متن کاملBiological data cleaning: a case study
As databases become more pervasive through the biological sciences, various data quality concerns are emerging. Biological databases tend to develop data quality issues regarding data legacy, data uniformity and data duplication. Due to the nature of this data, each of these problems is non-trivial and can cause many problems for the database. For biological data to be corrected and standardise...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012